Multivariate Geographic Clustering Using aBeowulf - style Parallel
نویسنده
چکیده
The authors present an application of multivariate non-hierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity called ecore-gions. Nine input variables thought to aaect the growth of vegetation are clustered at a resolution of one square kilometer. These data represent over 7.8 million map cells in a 9-dimensional data space. For the analysis, the authors built a 126-node heterogeneous cluster|aptly named the Stone SouperComputer|out of surplus PCs. The authors developed a parallel iterative statistical clustering algorithm which uses the MPI message passing routines , employs a classical master/slave single program multiple data (SPMD) organization, performs dynamic load balancing, and provides fault tolerance. In addition to being run on the Stone Souper-Computer, the parallel algorithm was tested on other parallel platforms without code modiication. Finally, the results of the geographic clustering are presented.
منابع مشابه
Multivariate Geographic Cluster Using a Beowulf-style Parallel Computer
The authors present an application of multivariate non-hierarchical statistical clustering to geographic environmental data from the 48 conterminous United States in order to produce maps of regions of ecological similarity called ecore-gions. Nine input variables thought to aflect the growth of vegetation are clustered at a resolution of one square kilometer. These data represent over 7.8 mill...
متن کاملMultivariate Spatio-Temporal Clustering of Time-Series Data: An Approach for Diagnosing Cloud Properties and Understanding ARM Site Representativeness
A multivariate statistical clustering technique— based on the iterative k -means algorithm of Hartigan (Hartigan, 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove...
متن کامل9.4 Using Clustered Climate Regimes for Understanding Water Cycle Variability
A multivariate statistical clustering technique— based on the iterative k -means algorithm of Hartigan (Hartigan, 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove...
متن کاملMultivariate Spatio-Temporal Clustering of Times-Series Data: An Approach for Diagnosing Cloud Properties and Understanding ARM Site Representativeness
A multivariate statistical clustering technique—based on the iterative k-means algorithm of Hartigan (Hartigan 1975)—has been used to extract patterns of climatological significance from 200 years of general circulation model (GCM) output. Originally developed and implemented on a Beowulf-style parallel computer constructed by Hoffman and Hargrove from surplus commodity desktop PCs (Hargrove et...
متن کاملMultivariate Spatio-Temporal Clustering (MSTC) as a Data Mining Tool for Environmental Applications
The authors have applied multivariate cluster analysis to a variety of environmental science domains, including ecological regionalization; environmental monitoring network design; analysis of satellite-, airborne-, and ground-based remote sensing, and climate model-model and model-measurement intercomparison. The clustering methodology employs a k-means statistical clustering algorithm that ha...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2011